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The clustering coefficient, path length and average vertex degree of two urban train line networks 
have been calculated. The results are compared with theoretical predictions for appropriate random 
bipartite graphs. They have also been compared with one another to investigate the effect of 
architecture on the small-world properties. 
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I. INTRODUCTION 

Since model small-world networks were first proposed 
some five years ago 0, 0) to interpolate between the 
properties of regular and random graphs, many investi- 
gations have been carried out which have confirmed that 
the small-world phenomenon occurs in a variety of real- 
world settings. 

There are two common features which make appar- 
ently very different networks all "small- world" 0. First, 
the characteristic path length between vertices is short, 
compared to what might be expected based on the total 
number of vertices. This is the phenomenon which has 
entered popular culture as "six degrees of separation" . 
The second feature, the one we are noting when we agree 
with a new acquaintance that the world is indeed small, is 
that two vertices both linked to a third vertex are likely to 
have a direct link also. This feature has been called clus- 
tering, and while random graphs have the first feature, 
they do not have the second. Comparisons of real-world 
networks with random graphs having the same number of 
vertices and edges indicate much higher clustering occurs 
in real, small worlds. In a recent review these real- 
world networks are roughly grouped as social, biological, 
technological and information networks. 

Among the first social networks to be studied in this 
context are the Hollywood network of actors 0,0 j linked 
by having appeared in the same movie, and the coUobora- 
tion network of scientists linked by having published 
together. By considering not just the people involved but 
also the media that created their links, the network can 
be seen to have a bipartite structure 5] , because nodes of 
one kind (actors/authors) can only be directly linked to 
nodes of the other kind (movies/papers). In contrast to 
the usual random graph, unipartite or one-mode projec- 
tions of random bipartite graphs onto vertices of the one 
kind inherit some amount of in-built clustering, from the 
links to common vertices of the other type It is thus 
more appropriate to compare the path length and clus- 
tering of such a real world network to the averaged prop- 
erties of random bipartite graphs with the same number 
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of vertices of each kind and the same probability distri- 
bution of links from one to the other type . 

The study reported here was motivated by two quite 
different investigations into small- world properties of rail- 
way networks, one type of technological distribution net- 
work. The first study (of the Boston subway network) 
takes as its definition of an edge that a physical piece 
of track should connect two vertices (stations). This 
presents immediate difficulties at the end of a line, where 
the concept of a local clustering coefficient becomes moot 
and the definition (see Q below) undefined. (To resolve 
this problem, the authors propose that different proper- 
ties should be studied; rather than clustering coefficient 
and average or greatest path length, they define local 
and global ejficiency, 

.nd cost m- Other authors have 
chosen not to include such vertices in their calculation.) 

In the second study H^, of part of the railway net- 
work of India, two stations are defined to be linked if it 
is possible to embark at the first and reach the second 
without having to change trains, regardless of the num- 
ber of intermediate stations, i.e., they lie on the same 
line or route. This definition removes the "end of the 
line" problem noted above. However, it imposes the same 
kind of bipartite structure .5] as is present in the original 
study of the Hollywood actors network P, Q , and stud- 
ies of the collaboration network of scientists 0, Q and 
of Fortune 1000 directors (sit together on the board of a 
company) [sl lll|. However, this bipartite structure was 
not referred to in the analysis provided in 10], unlike the 
somewhat more whimsical study of the (artificial) Marvel 
comic book universe Q, our third motivation. 

Therefore, we wanted to investigate railway networks 
from this bipartite viewpoint, comparing the proper- 
ties of networks having different architecture with each 
other, and with those of the appropriate random bipar- 
tite model. The two networks we have chosen are Boston 
[l^ . because of the earher study, and Vienna be- 
cause of different features in its layout. In the following 
section we give the definitions of the network properties 
we calculate. In Section Fill Al we explain features of the 
systems that we studied and give our results. The paper 
closes with discussion and a some conclusions. 
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II. NETWORK PROPERTIES 
A. Definitions 

There are two different definitions of the measure of 
chquishness or clustering in a network 0,0] of N nodes, 
which attempt to quantify the same property (but which 
have not always been clearly distinguished). They both 
take values in the interval [0, 1]. The first definition 
which is convenient for analytic calculations, corresponds 
to what sociologists refer to as transitivity, and will be 
denoted by C^^': 



3 X number of triangles in the network 
number of connected triples of vertices 



(1) 



The second definition is easier to apply to an actual 
network, i.e., numerically. Chronologically, this defini- 
tion preceded ^ in the small- world literature 0|. If 
node V has ky neighbors, there are ('^2") possible connec- 
tions between them. The local clustering coefficient of 
node V is the fraction of these that are actually present: 

^ 2 X number of links between neighbors of v 
ky [ky 1) 

and then the clustering coefficient of the whole network 
is the average of the C^: 



(3) 



The number of other vertices to which a vertex v is 
linked is its degree ky and the average degree for the 
network is z. If there are K edges in the network, clearly 



K 



Nz 



(4) 



since each edge adjoins two of the vertices. 

Except in limiting cases, it is probably not important 
to make too much of the differences between ^ and (O ; 
after all, there are several different measures of network 
size in common usage. Define the distance dij between 
two nodes i,j to be the length (in number of edges) of 
the shortest path between them. The diameter D of 
a network is then the greatest distance between any two 
randomly chosen nodes. The average path length L is the 
average of the distances c?.y between all pairs of nodes: 

^- N-{N-iy 

Analytically, it proves useful to define |^ a typical path 
length £ which is determined by finding when the sum of 
the average number of first zi{= z), second Z2 , ■ ■ ■ , zi 
neighbors of a vertex reaches N: 



N. 



(6) 



Despite their differences, these various measures of 
network size scale as log(A'^) for a random graph; since 
there is no preferential attachment, the clustering coeffi- 
cient of a random graph (by either measure) is of order 
Crandom = 0(iV~^) Q . Morc precisely, it is a result 
of some twenty year's standing in random graph theory 
that £ = \os(N)/log(z), while the clustering coefficient 
was shown I3I to be 
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ndom 



N{N - !)■ 



(7) 



The conditions for a network to be small-world are that 
the characteristic path length is comparable to that of a 
random graph (with the same N and z) while it displays 
significantly more clustering. 



B. Formulae for random bipartite networks 

For a bipartite network, the number of nodes N refers 
to the vertices of the one- mode projection (e.g., the actors 
or, in this case, the stations). The number of nodes of the 
other type (train lines) will be denoted M, the average 
number of train lines a station lies on is fi and the average 
number of stations per line is v so that 



(8) 



The number of links, K , refers to the unipartite graph. 

Newman et al. have given expressions for average 
values of the network properties defined above, for vari- 
ous distributions of vertex degree, in terms of their gen- 
erating functions. In the case of bipartite graphs, the 
appropriate expressions are summarized below. 

For specific real- world graphs, such as these two train 
networks, one knows the degree of each vertex, so that 
the generating functions are polynomials. Let pj be the 
probability that a station appears on j train lines, and 
Qk be the probability that a train line has k stations on 
it. Then the two generating functions needed are: 



(9) 



Defining further generating functions in terms of these 
(their interpretation is given in |^ but is not needed for 
calculations) 

h{x) = i/^(a;), g^ix) = ^g',{x), (10) 

then the number of first and second neighbors of a station 
chosen at random are 



^i = /o(iK(i), 

Z2 = mf[m[{i)?; 

the typical path length © is 

\og[{N - 1)(Z2 - 21) + zf] - logzf 



£■ 



I0g(z2/Zi) 



(11a) 
(lib) 



(12) 
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and the clustering coefRcient ^ is 



(13) 



Here Gq (x) has interpretation as the generating function 
of the probabihty distribution of number of first neigh- 
bors (i.e., degree) on the unipartite graph of stations, and 
is defined by 



Go(x) = /o(.9i(x)). 



(14) 



Of course, the role of the nodes could be interchanged, 
i.e., a projection made onto a one-mode graph of train 
lines rather than stations. In the case of company di- 
rectors, studying the interlock of Fortune 1000 compa- 
nies (rather than degree distribution for the network of 
directors) has provided explanation for the adoption of 
common corporate policies and practices [TTI | . 



III. THE SMALL WORLDS OF BOSTON AND 
VIENNA 

A. This study: stations and trains 

In early sociological studies of networks, getting a good 
definition of connection was sometimes problematic 
While for technological networks this is generally less of 
a problem, recall the definition of a link between two 
stations used in this study: that one can travel between 
them without changing train. The railway networks were 
treated as undirected and unweighted; if more than one 
train line can be used to travel from a particular station 
to a second, with no change of train, in either direction, 
this counted as a connection. Further, no weighting was 
made for timetable information, such as the number of 
trains running on a line per day or whether trains ever 
"run express" . 

The subway network of Boston consists of four lines, 
serving 124 stations The network is decentralized, 

in that no station lies on all four lines, thus having every 
other station as a direct neighbor with our definition of 
adjacency. In fact, the highest value of fc^, is 97. The 
diameter of the network is D = 3. One line splits into 
two branches and another into four. Consistent with our 
definition of connectivity, these were treated as separate 
lines, giving eight in total, some of which have a series of 
stations in common. 

There are five lines serving 76 stations in the U-Bahn 
network of Vienna, which is also decentralized. These 
lines interact with each other at single stations, though 
possibly more than once due to their tendency to follow 
curved (rather than radial) paths. The highest degree is 
ky = 43 and the diameter is also 3. 

From the network maps, the value of dij for each pair 
of stations was entered into a spreadsheet. Using vari- 
ous spreadsheet functions, the degree of each station fci,, 
the total number of links K, the average degree z = zi. 



the clustering coefficient C as defined in (jSJ, and the av- 
erage path length L defined in were calculated 0- 
By counting the instances when dij — 2, 3, the average 
number of second Z2 and third Z3 neighbors could also be 
determined. These 'Actual' values appear in Table 

Also from the maps, the discrete probability distribu- 
tions pj and Qk required for using the generating functions 
were identified. These appear in Tables |n] and IIIII 
The mean properties of random bipartite graphs with the 
same distributions of links from one type of node to an- 
other, zi, Z2, i and C^"'"-', were calculated using equations 
(|11|I - H13|) . and appear in TableQlin the 'Theory' columns, 
together with the total number of (predicted) links, cal- 
culated using zi in Finally, using (|T|) , Crandom WaS 
calculated for the "usual" random graph. 

Having /o(a;) and go{x) exactly, Maple was used to 
find the generating function for the probability distribu- 
tion Vz of number of stations z a randomly chosen station 
is linked to: 



Go{x) = foigiix)) 



r,x 



(15) 



By swapping the roles of / and g the probability distri- 
bution for interactions (i.e., interlock) between train lines 
could also be determined. 



B. Discussion 

From Table m both networks satisfy the two basic con- 
ditions for being small-world. The size of the networks, 
measured by diameter (D — 3 for both) or by average 
path length L, is small when compared to the number 
of vertices. The actual clustering coefficients are much 
greater than Crandom for appropriate N, K. While in 
relative terms the ratio C/Crandom is orders of magni- 
tude smaller than, say, that observed for the Hollywood 
network P, |^, it is comparable with that observed in 
studies of networks of similar size as listed in Table ITTI 

However, in absolute terms, for both networks C is 
very much higher than the values reported in other stud- 
ies. Our definition of connectivity has resulted in C being 
so close to 1 because of the proportion of stations in each 
network (> 80%, see TableEll which lie on a single train 
line. All stations to which they are connected are also 
linked to one another, forming a clique, and each giving 
C„ = 1 as their contribution to ^ . The most useful com- 
parison may be to the railway network of India [l(| which 
has C = 0.69. A country- wide train network is not con- 
strained as an urban subway network is, and train routes 
criss-cross between stations in different patterns, giving 
lower local clustering. 

However, some amount of this clustering is expected 
solely on the basis of the underlying bipartite structure 
0, and the calculation of C^"^' gives a measure of it. 
Although they are not precisely the same quantity, com- 
paring the actual value of C to the theoretical C^"^' for 
the same distributions pj and qk shows that the Boston 
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Actual 


Boston 


Theory 


Actual 


Vienna 


Theory 


Number of stations, A'^ 




124 






76 




Number of train lines, M 




8 






5 




Number of links, K 


1711 




1936 


785 




788 


Average degree, zi = z 


27.597 




31.226 


20.658 




20.737 


Mean 2nd neighbors, Z2 


91.048 




766.75 


43.842 




97.213 


Mean 3rd neighbors, Z3 


4.355 






10.500 






Network size, L, £ 


1.8110 




1.4187 


1.8646 




1.7236 


Clustering coefBcient, C, C*-^' 


0.9276 




0.4808 


0.9450 




0.7989 


C'random 






0.2244 






0.2754 



TABLE I: Network properties for each train network. The results in the 'Actual' columns were obtained from the network 
maps (by spreadsheet calculations). Apart from Crandom, the results in the 'Theory' columns were obtained from the formulae 
in Section fll BI for random bipartite graphs with appropriate specified probability distributions, using Maple. 



jj 1 2 3 4 5 6 



Boston 100 12 3 
Vienna 67 8 1 


5 


3 


1 


TABLE IL Number of stations lying 


on j lines. 


i.e., Npj 




k: 7 12 14 18 19 


20 21 


24 


33 


Boston 1 11 
Vienna 1 1 


1 

1 1 


3 
1 


1 



TABLE IIL Number of lines containing k stations, i.e. Mqk- 

network appear to have more genuine or "excess" cluster- 
ing. For comparision, in Table Ivl some other values from 
the literature are listed. In social networks, this clus- 
tering is explained by people introducing their friends or 
collaborators to one another P, |^ while for the artificial 
world of Marvel comics it arises from the way that the 
(6 486) characters are not distributed randomly across 
all the (12 942) comic books but appear in teams, in se- 
ries 0. The Vienna network has a structure similar to 
that originally proposed to model small- worlds P, 0] : it 
consists of cliques of connected nodes (stations on a sin- 
gle line) which interact with one another through certain 
nodes (stations common to two lines) which connect the 
network but have very low values of Cv On the other 



Network 


N 


K 


C / Crandom 


Reference 


Boston 


124 


1711 


4.13 




Vienna 


76 


785 


3.43 




Dolphins 


64 


159 


3.74 


fl4] 


China airports 


128 


1165 


5.13 




C. elegans 


282 


1974 


5.60 




Hollywood 


450,000 


2.55x10^^ 


789 


f5] 



TABLE IV: Actual clustering coefficients compared to 
Crandom for appropriate network size A'' and number of links 
K. 



Network 


C (actual) 


C'"^^ (theory) 




Ref. 


Boston 


0.9276 


0.4804 


1.93 




Vienna 


0.9450 


0.7989 


1.18 




Fortune 1000 


0.588 


0.590 


1.00 




Physics co-authors 


0.452 


0.019 


2.35 


m 


Hollywood 


0.199 


0.084 


2.36 




Marvel 


0.192 


0.0066 


29.09 





TABLE V: Comparison of actual and theoretical clustering 
for a number of one-mode networks with underlying bipartite 
structure. 



hand, because of the branched lines in the Boston net- 
work, whole groups of stations are shared between lines, 
overlapping the cliques and enhancing for the shared 
vertices. 

The average degree z as predicted by the random bi- 
partite theory agrees well with the actual observed values 
(see Table nj. Using z to calculate the total number of 
links K shows that this agreement is better for Vienna 
than Boston, where the prediction is for an extra 200-odd 
links between the 124 stations. This is also attributable 
to the effect of the branched lines, causing stations to in- 
teract repeatedly with groups of common stations, rather 
than having links to as many new stations as might be 
expected based on the number of lines on which they lie. 

The worst discrepancy between the predictions of the 
random bipartite model and the actual results was in es- 
timating the number of second neighbors Z2. In both 
cases, the predicted number exceeded the actual average 
number of second neighbors. This is a flow-on from the 
high local clustering; if a node and most of its neighbors 
form a clique, there are few paths of length two. But the 
theoretical Z2 also exceeded the total number of nodes in 
the network, in the case of Boston by an order of magni- 
tude! 

There are several limitations to our study which may 
have affected how the results compare with the bipar- 
tite theory. The small-world effect is notable in systems 
which are not only connected and decentralized (as ours 
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are) but also large, and sparce Though the value of 
is not extremely large, it is comparable to the other 
studies cited in Table Hvl Sparceness refers to the num- 
ber of links K in comparison to the number of nodes N, 
specifically that 

K <^ or z«iV-l. (16) 

We have z/{N—l) « 0.25 which is not particularly small, 
and is certainly larger than the studies cited in Table HVl 
One feature of our study which appears unique is that 
M/N w 1/15. For other networks which have been stud- 
ied from the bipartite point of view this ratio takes val- 
ues: 0.5 (Marvel universe @), 1.0 (Indian railroad Il0|), 
2.0 (Hollywood 0, 0), and 8.9 (Fortune 1000 0, [n|). 
We speculate that the balance of the various sizes {N, 
M, K) may be the main problem with the second degree 
estimate, but since no other authors have calculated or 
commented on 22 in their studies we cannot offer further 
support for this explanation. (We hence do not give too 
much attention to the calculated value of £ since it relies 
on both N and Z2 ) 

Finally we comment on the distributions of linked sta- 
tions and interlocked train lines, though we do not give 
the corresponding long but rather sparce tables of actual 
values. The distribution from (|15|l for both Vienna 
and Boston corresponded closely to the actual values, 
with some smoothing and a long, low tail as would be 
expected. The analogous generating function, for how 
many lines a randomly-chosen line shares stations with, 
gave reasonable agreement with the actual values for Vi- 
enna. However, the expression for Boston suffered from 
a similar problem to the Z2 results referred to above; it 
was a very smeared-out distribution which gave the high- 
est probability to 22 lines, though actually the maximum 
possible is M — 1 = 7. Presumably this problem is due 
not so much to M being small (since Vienna would have 
had that problem too), but mainly to the unusual actual 
distributions which result from the common stretches of 
line, which the averaged distribution from the random 



model struggles to mimic. In this context, it should be 
noted that the theoretical interlock of company directors 
was not in close agreement with the actual data, and 
the proferred explanation is similar (when translated into 
more human terms). 



C. Conclusion 

To quote the paper Q that introduced analysis based 
on bipartite structure to the small-world literature: "[It] 
is perhaps best to regard our [bipartite] random graph 
as a null model — a baseline from which our expectations 
about network structure should be measured. It is de- 
viation from the random graph behavior, not agreement 
with it, that allows us to draw conclusions about real- 
world networks. " Although there have been other studies 
of small- world properties of transport networks p!^ , in- 
cluding railway networks Q , this study appears to be the 
first that has not only had underlying bipartite structure 
01, but has used this in its analysis. Further, because 
we have studied two networks at once, we have been able 
to draw comparisons between them based on their dif- 
ferent train-line architecture. We have seen properties in 
common, such as high C, properties close to prediction, 
e.g., z, and properties which vary from the random bipar- 
tite graph model, and we have attempted to explain the 
behavior in terms of our definition of connectivity and 
how this works itself out in the actual networks. This 
study extends the application of bipartite analysis from 
social affiliation networks to the technological; pre- 
sumably there are examples to be found also among real- 
world information and biological networks. 
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